Boosting the prediction and understanding of DNA-binding domains from sequence

نویسندگان

  • Robert E. Langlois
  • Hui Lu
چکیده

DNA-binding proteins perform vital functions related to transcription, repair and replication. We have developed a new sequence-based machine learning protocol to identify DNA-binding proteins. We compare our method with an extensive benchmark of previously published structure-based machine learning methods as well as a standard sequence alignment technique, BLAST. Furthermore, we elucidate important feature interactions found in a learned model and analyze how specific rules capture general mechanisms that extend across DNA-binding motifs. This analysis is carried out using the malibu machine learning workbench available at http://proteomics.bioengr.uic.edu/malibu and the corresponding data sets and features are available at http://proteomics.bioengr.uic.edu/dna.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

In silico investigation of lactoferrin protein characterizations for the prediction of anti-microbial properties

Lactoferrin (Lf) is an iron-binding multi-functional glycoprotein which has numerous physiological functions such as iron transportation, anti-microbial activity and immune response. In this study, different in silico approaches were exploited to investigate Lf protein properties in a number of mammalian species. Results showed that the iron-binding site, DNA and RNA-binding sites, signal pepti...

متن کامل

P-84: Characterization of Androgen Receptor Structure and Nucleocytoplasmic Shuttling of the Rice Field Eel

Background: Androgen receptor (AR) plays a critical role in prostate cancer and male sexual differentiation.Mechanisms by which AR acts and regulations of AR nucleocytoplasmic shuttling are not understood well. Materials and Methods: Degenerate PCR and RACE Cloning of AR Gene; Phylogenetic Analysis and Molecular Modeling;Real-time Fluorescent Quantitative RT-PCR; Northern Blot Hybridization;In ...

متن کامل

Prediction of 3D protein Structure based on Mutation of AKAP3 and PLOD3 Gene in Case of Non-Obstructive Azoospermia

Background: The present study has been designed with the aim of evaluating A-kinase anchoring proteins 3 (AKAP3)and Procollagen-Lysine, 2-Oxoglutarate 5-Dioxygenase 3 (PLOD3) gene mutations and prediction of 3D proteinstructure for ligand binding activity in the cases of non-obstructive azoospermic male.Materials and Methods: Clinically diagnosed cases of non-obstructive azoos...

متن کامل

A Threading-Based Method for the Prediction of DNA-Binding Proteins with Application to the Human Genome

Diverse mechanisms for DNA-protein recognition have been elucidated in numerous atomic complex structures from various protein families. These structural data provide an invaluable knowledge base not only for understanding DNA-protein interactions, but also for developing specialized methods that predict the DNA-binding function from protein structure. While such methods are useful, a major lim...

متن کامل

Designing, Optimization and Construction of Myelin Basic Protein Coding Sequence Binding to the Immunogenic Subunit of Cholera Toxin

Abstract Background and Objectives: Multiple sclerosis (MS) is a chronic inflammatory autoimmune disease. Mucosal feeding of myelin basic protein binding to the cholera toxin B subunit can reduce the intensity of the immune response in MS patients. Expression system, the domain composition of the fusion protein, accessibility of two domains, codon adaptation index (CAI) and GC contents are v...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 38  شماره 

صفحات  -

تاریخ انتشار 2010